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Abstract. There is a deep connection between the behavior of the maximal 
C ~ ) ' variation of posterior probabilities over a set of states K and asymptotic prop- 

£N) ' erties of repeated games with incomplete information. The paper is concerned 

with a case of "big" K and specific effects arising in this case (a previous study 
was mainly focused on finite K). It turns out that for priors with heavy tails 
the maximal variation and a game value both can grow anomalously fast. A 
family of functionals (including the Shannon entropy) measuring uncertainty 
of prior distribution naturally arise to characterize this phenomenon. 
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1. Game-theoretical objects. 



H 

■ Repeated two-person zero-sum games with incomplete information on one side 

(hereafter RGII) were introduced by R. Aumann and M. Maschler (see PQ,[7]). The 
q ' main feature of RGII is a repeated interaction between players forming a mechanism 

of information transmission from the informed player (PI) to the uninformed player 
(P2). PI knows what game they are actually playing and P2 does not know, so 
he tries to guess the state of art observing PI actions. Hence PI should use his 
strategic advantage not revealing his private information too fast. 
G\ ; RGII T N (p) is given with: a collection T = (K, S, T, {gk}k£K ), where if is a set 

of states, S and T are action sets of PI and P2, : 5 x T -> R is a one stage pay- 
off function at the state fc; a measure p £ A (if) (A(A) is the set of all probability 



measures over A) and a number of stages N 6 N. To simplify the consideration 



and to avoid measurability conditions we assume that K, S and T are finite or 
countable. Before the game starts, a state k £ K is chosen randomly according to 
p. PI is informed on the state fc; P2 is not. At n-th stage of the game (n = 1, . . . , N) 
PI and P2 simultaneously select their actions s n € S and t n € T using information 
they have at this stage, then the actions are announced. At the end of the game PI 
receives, and P2 loses ££=iffk(«n,*n). Let h n = {(s h U)}^ 1 e (S x T)™ -1 = H n 
be the history up to a stage n (Hi = {0}). A (randomized) strategy of PI is 
a : K x H -> A(S), and a strategy of P2 is r : H -> A(T), where H = \_£ =1 H n . 
In other words, at each stage the strategy maps the information that player has 
to the randomized action. PI wants to maximize his average gain Gn(p,ct,t) — 

H£p,o\r Sn=i 9k( s n, tn) regardless of the opponent's strategy. The expectation is 
taken with respect to the probability measure Pp,cr,T over K x Hn generated by the 
strategies and p. 

One says that the game has a value v [Tjv(p)] if for any e > there are strate- 
gies a £ and t £ such that Vr Gn(p, c e , t) > i;[rjv(^)] — £ and Vcr Gn(p, f, r E ) < 
v [Tjy(p)] + £• The strategies a E and r e are e-optimal strategies (optimal strategies 
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if e = is admissible). In particular, if S, T and K arc finite, then the players 
have optimal strategies (all classical results correspond to this situation). If K is 
countable with finite S and T, then there are e-optimal strategies. For infinite S 
and T e-optimal strategies may fail to exist. 

Let r^ R (p) be a so-called non-revealing game associated with T. The only differ- 
ence from one-stage game Ti(p) is that PI "forgets" k, i.e., the class of his strategies 
is restricted to maps H = H i — > A (5). 

Denote by T(K) the set of collections T such that: the set of states is K; S 
and T are finite or countable; \\g\\ = sup fe s t \gk(s,t)\ < oo; for any p e A(if) and 
N G N there are e-optimal strategies in the games Tjv(p) and r^ R (p); the value of 
the non- revealing game w[r^ R (p)] = (it means that the only strategic advantage 
of PI in Tjf(p) is his private information). 

One of the main problems of the theory is a game value asymptotic behavior as 
N —> oo. In the case of < oo, denotes the number of elements) it is known 
that for games from T(K) the game value can't grow faster than TV 1 / 2 in power-law 
scale. There are examples of games with such speed of value growth. The aim of 
this note is to describe the maximal speed of v [Tjv(p)] growth for games from T(K) 
without finiteness assumption. Consider the following quantity 

, \ ln(" Fiv(p)]) 1 
a{p) = sup hmsup — - 

rer(if) N-*ao in-fV z 

(1/2 is subtracted to get a(p) = in the classical case #K < oo). It turns out that 
for infinite K an "anomalous growth" is possible, i.e., a(p) > 0. The main focus 
will be on countable K, but the approach can be generalized to an arbitrary Polish 
space. 



2. Maximal variations and game values. 

Let p n be the posterior distribution of k given a history h n , i.e., p n (A) = 
^p,a, T {{k £ A} | h n ) and p\ = p. The sequence {p n } n =i forms a martingale 
with respect to the filtration generated by h n (i.e., {p„(A)}^ r = j 1 1 is a martingale for 
any A C K). 

Let us introduce the following objects: the TV-term variation of a sequence of ran- 
dom measures {p n }n=i over K is defined by Vjv({^n}) = E Sn=i \^n+i ~ Mn| (1*1 
denotes the total variation of a signed measure $); Ai°°(p) is the set of all martin- 
gale sequences of random probability measures {pn}^! over K with non-random 
pi = p (a probability space and a filtration are not fixed); Ai D (p) C A4°°(p), 
D EN\{1} consists of measure- valued martingales admitted to the filtration gen- 
erated by the sequence of random variables with the number of different values 
less than D; the maximal variation £^(p) = sup{Vat({^„}) | {p n } G M D (p)}, 
De (iV\{l})U{oo}. 

If r G T(K) and K, S and T are finite, then the following estimate holds 
v [Tjy(p)] < ||5||Vjv({p})i where {p n } corresponds to the optimal strategy of PI 
(this estimate is due to R. Aumann and M. Maschler, see [l],[2]). Without finiteness 
condition (there may be no optimal strategies) one can get v \Tn(p)] < ll5ll^ S (/ 3 )! 
i.e., the maximal variation bounds the value from above. 

For the Bernoulli distribution p = (p, 1 — p) (over two-element K) J. Mortens 
& S. Zamir [2] and B. De Meyer [4] derived an explicit asymptotic formula for 
the maximal variation £n(p) ~ ^VN(j){x p ), N — > oo, where <j>{x p ) is the density 
of the standard normal distribution at its p-quantile. In 3 2 x 2-games (i.e., 
#5 = #T = 2) were constructed such that v [Tjv(p)] ~ £n{p) (generalized in [3] to 
rn x m-games, m G N). Thus for j^K — 2 one has a(p) = 0. The same holds even 
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for finite or countable K and p with not too heavy tails, since (see Q],[I]) 

Cn(p) < Viv^ Vp({im-p({i}))- 

l£K 

In A. Neyman's paper [5], that is very close to this note in basic ideas, a variation 
was estimated by the Shannon entropy: 

Hence the set of priors p with a(p) — was significantly extended. But it was 
unknown what happens if S(p) = oo, and can the class with a{p) — be even 
wider. 

3. Estimates of variation growth. 

Consider a more general setting. Let K, be the unit interval [0, 1] (or an arbitrary 
Polish space); let p = 9 p c + (1 — 9)pd e A(/C), 9 e [0, 1] be a "prior" distribution 
decomposed into the continuous component and the discrete one. Similarly to the 
definition above one can define the maximal variation £,®(p) {{p n } is a A(/C)-valued 
martingale if for any measurable A C K. the sequence {p n (A)} is a martingale). 
The case p = pd corresponds to finite or countable K (one needs to enumerate 
atoms of p by elements of K). Let us define the family of uncertainty measures of 
Pd 

1 



Zs(Pd) = V Pd ({l}) 



leK: Pd({i})>0 



In 



Pd({l}) 



5 < 1/2. 



Theorem. For any D £ (N \ {1}) U {oo} the following two-sided estimate holds 

(1) ^(P)>0^(pc) + (l-9)^(p d ), 

(2) tN(p)<H8(Pc)+&(pt)+&(b e ), 

where p^ = 9Si a + (1 — 9)pd, 8i is the Dirac measure concentrated at a point 
la G /C that is not an atom of pd , and be is the Bernoulli distribution with success 
probability 9. There is an explicit formula for purely continuous prior distribution 



(3) mpc) = 2N^i- D 

If purely discrete prior distribution satisfies Z@{pd) < oo for some (3 € [0, 1/2], then 

(4) t°(p d )<2V6N^Zp(p d ); 
if Z-y(pd) = oo for some 7 € (0, 1/2), then for any 7' < 7 

(5) hmsup^( j0(i )/A^ +7 ' =00. 

JV-s-oo 



Remark (main ingredients of the proof). The estimates ([T]) and ^ are conse- 
quences of the mutual singularity of the components pd and p c and the martingale 
property. To prove ([3]) for finite D one can construct a martingale {^^^=1 & 
M D (pc) such that Vat({7t„}) = 2N(1 - l/D). For any v e A(/C) denote by v d a 
normalized restriction of v to a subset B d c K, (i.e., u d ( ■) = u(- n B d )/u{B d )), 
where {B d }® =1 is some disjoint measurable partition of K, such that v{B d ) = l/D. 
Let { / K n ]'^ = i be the Markov chain with a state space A(/C), initial state iri = p c , 
and transition probabilities P(-K n+ i = Tr d \ ir n ) — l/D, d — 1,..,D. Then 
for any N e N one gets £%{p c ) > Vjv({7r n }) = 2N(l - l/D). The inequality 
£n(Pc) — 2N(1 — l/D) follows from the martingale property. 
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The estimate (U) is based on the representation of the variation of a scalar 
martingale derived by B. De Meyer in [3]. Combining this representation with 
large deviation estimates (similar to S. Bernstein inequalities) one can deduce ([4]). 

To prove ^ one can adapt the construction of {7T n }'^L l to the discrete case. For 
discrete v e A(/C) a partition {B^} may fail to exist, but a partition "close" to 
{Bf} may be chosen. To control that the corresponding martingale {7r n } is "close" 
to {Tr n }%Li recurrent estimates are necessary, and they lead to (0. 

4. The game with anomalous value growth. 

It is shown above that £^ (p) can grow anomalously (faster than V~N), but this 
doesn't imply the existence of games from T(K) with anomalous value growth 
{a(p) > 0). Here an example of such game (Sjv(p) is constructed. Let go(0, 0) = 
-!7o(l,0) - 3, 00(1,1) - -9o(0,l) = 1 and ffl (0,0) - -$i(l,0) = <?i(l,l) = 
—51(0, 1) = 2. Corresponding 2 x 2-RGII with action sets and set of states coinciding 
with {0, 1} = Z2 was investigated in [3] and is mentioned above. 

Suppose that K is finite or countable. Let us describe the RGII &n(p) informally. 
Each stage of this game n = 1, . . . , N is suitable to represent as being played in 
two steps. At the first step PI selects a finite subset X C K, and P2 selects finite 
Y C K, then X n Y is told to P2. At the second step if k G X, then they play in 
2 x 2-game corresponding to gi, and to go if k G K \ X (that is PI and P2 select 
h and / from Z 2 , respectively, and the contribution of this stage to the total gain 
of PI is g x ix,k){h>, /), where k) = 1 if k G X, and otherwise). Hence &n{p) 
corresponds to the collection © = (K,S,T,g), where S = {(X,h) E 2 K x Z 2 | 
#A < 00}, T = {(Y, f), Y G 2 K , f : 2 Y -> Z 2 | #Y < 00, }, and pay-off function 
0fe((A, h), (Y, /)) = g x (x,k)(h, f(Xf]Y)). Note that S and T are countable or finite. 

Using the idea of reduction of RGII to the martingale optimization problem 
from B. De Meyer's paper [S] one can prove the following theorem. 

Theorem. The collection 25 € T(K), and for any p G A(i^) and N £ N the value 
of the corresponding game v [&n(p)} > £ivG°)/4- 

Corollary. For countable K and p G A(JsT) not concentrated at one point 
a{p)=mi{5e [0,1/2] | Z 5 {p) < 00}. 

The construction of <Sjv(/j) can be generalized to arbitrary Polish space K. Note 
that the infiniteness of action sets is necessary for the anomalous growth: if Y G 
T{K) and action sets S and T are finite, then V<5 > v [T N (p)\ = o^N 1 / 2 ^). 
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